Query Record Blocking Many Possible Matches Scoring Match Probability Potential Match Record Matching

نویسندگان

  • Andrew Borthwick
  • Maggie Soffer
چکیده

This paper seeks to describe the business requirements imposed on a record matching system along ten different dimensions. For each dimension, we present alternative requirements which different record matching clients might have. We seek to discuss the factors that might lead a client to determine that they have one requirement or another. The goal of the talk is to better prepare a client to understand their record matching needs and help them to evaluate the offerings of record matching system vendors.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The ChoiceMaker 2 Record Matching System

This paper describes the key features of an innovative record matching system called ChoiceMaker 2 developed by ChoiceMaker Technologies (CMT). We begin with an overview of the stages that a record matching system goes through to find an incoming “query record” in a database. We then consider the stages one by one: We sketch out our patent-pending process for identifying possible matches to the...

متن کامل

Optimal Identity Matching via Statistical Estimation and Information Acquisition

The accelerating growth of the Internet, along with the current stress on privacy that limits the nature of data that organizations can collect and use, has rendered it increasingly difficult to control data quality. To that end many organizations use identity matching software to help determine whether an incoming record pertains to the same subject as that of an existing record in their syste...

متن کامل

Implementing a Bayesian Approach to Record Linkage

The Census Coverage Measurement survey-based program estimated household population coverage of the 2010 Decennial Census. Calculating coverage estimates required linking survey person data to census enumerations. For record linkage research, we applied a Bayesian Latent Class Models approach to both 2010 coverage survey data and simulated household data. This paper presents our use of Base SAS...

متن کامل

The Impact of the First Goal in the Final Result of the Futsal Match

Among the many technical and tactical aspects of the behavior of players, the goals are the most studied. The goal is the key to success for teams and its analysis in all matches of a major futsal tournament (World Cup) that allows multiple assessments. The aim of this study was to analyze the impact of the first goal for the final result in the futsal match, identifying the team that scored th...

متن کامل

Improving EM Algorithm Estimates for Record Linkage Parameters

The EM algorithm can be used to estimate conditional probabilities for matching field patterns for the Fellegi-Sunter model for record linkage. The algorithm is based on a latent class model for the record pairs where one of the classes is the set of true matches. If the number of true match pairs in the data set is too small, then the EM algorithm cannot detect the correct latent class. We con...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004